Tooling the Lexicon Acquisition Process for Large-Scale KBMT

نویسندگان

  • John R. R. Leavitt
  • Deryle W. Lonsdale
  • Kevin Keck
  • Eric Nyberg
چکیده

Large-scale lexical knowledge acquisition is one of the most time critical steps in developing a knowledge-based machine translation system. In particular, developing the syntactic lexicon for the target language can be an unwieldy task, as on-line knowledge assets are likely to be more scarce than for the source language. This paper addresses this problem within the KANT machine translation system and describes how we structure the KA process to address this problem. This was done by first determining the nature of the desired process and then developing tools to implement that process. The tools themselves and the ways in which the helped us to realize out design goals are described. We conclude that, while the problem of lexical acquisition can be formidable, it can be overcome with proper foresight and tool design.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automated Corpus Analysis and the Acquisition of Large, Multi-Lingual Knowledge Bases for MT1

Although knowledge-based MT systems have the potential to achieve high translation accuracy, each successful application system requires a large amount of hand-coded knowledge (lexicons, grammars, mapping rules, etc.). Systems like KBMT-89 and its descendents have demonstrated how knowledge-based translation can produce good results in technical domains with tractable domain semantics. Neverthe...

متن کامل

Automated Corpus Analysis and the Acquisition of Large, Multi-Lingual Knowledge Bases for MT

Although knowledge-based MT systems have the potential to achieve high translation accuracy, each successful application system requires a large amount of hand-coded knowledge (lexicons, grammars, mapping rules, etc.). Systems like KBMT-89 and its descendants have demonstrated how knowledge-based translation can produce good results in technical domains with tractable domain semantics. Neverthe...

متن کامل

A Lexicon for Knowledge-Based MT

In knowledge-based machine translation (KBMT), the lexicon can be specified and acquired only in close connection with the specification and acquisition of the world model (ontology) and the specification of the text meaning representation (interlingua) language. The former supplies the atoms for the specification of text meaning and provides world knowledge to support the inference processes n...

متن کامل

Acquisition of Large Lexicons for Practical

Although knowledge-based MT systems have the potential to achieve high translation accuracy, each successful application system requires a large amount of hand-coded lexical knowledge. Systems like KBMT-89 and its descendents have demonstrated how knowledge-based translationcan produce good results in technical domainswith tractable domain semantics. Nevertheless , the magnitude of the developm...

متن کامل

Building a Large-Scale Knowledge Base for Machine Translation

Introduction Linguistic Resources Knowledge-based machine translation (KBMT) systems have achieved excellent results in constrained domains, but have not yet scaled up to newspaper text. The reason is that knowledge resources (lexicons, grammar rules, world models) must be painstakingly handcrafted from scratch. One of the hypotheses being tested in the PANGLOSS machine translation project is w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994